Revisiting Accelerator-Based CMPs: Challenges and Solutions

نویسندگان

  • Nasibeh Teimouri
  • Hamed Tabkhi
  • Gunar Schirner
چکیده

Heterogeneous Chip MultiProcessors (CMP)s, which combine processor cores with specialized HW accelerators, are one main approach to high-performance low-power computing. While it is promising for few accelerators, the scalability is a major challenge with increasing number of accelerators. Resources including memory, communication fabric and processor turn into bottlenecks and result in accelerator under-utilization and cripple the performance. This paper analyzes the scalability of heterogeneous CMPs with many accelerators and identifies bottlenecks and their impacts on system performance. It introduces an analytical method for scalability/bottleneck analysis that is backed up by a simulation-based performance analysis (using automatically generated virtual platforms). This paper proposes a novel architecture template: Transparent Self-Synchronizing (TSS) accelerators for efficient/scalable realization of streaming applications. TSS achieves the efficiency / scalability through configurable point-to-point connections and self synchronization between HW accelerators and efficient management of accelerator’s memory. This article demonstrates the TSS benefits using both analytical and simulation methods. TSS significantly reduces the pressure on the communication fabric, processor load, and memory requirements to improve scalability. Even with increasing number of accelerators, TSS can achieve more than 85% accelerator utilization. In contrast, in ACC-based CMPs the accelerator utilization drops fast; less than 40% with six accelerators or even worse with more accelerators. The scalability benefits of TSS are more pronounced as the number of hardware accelerators increases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AXR-CMP: Architecture Support in Accelerator-Rich CMPs

To improve performance/power efficiency, we expect that future CMPs may use special-purpose accelerators extensively. This work discusses hardware architectural support for accelerator-rich CMPs. First, we introduce an efficient cache management scheme for accelerators to mitigate memory latency by overlapping data transfer with computation. Second, we present a hardware resource management sch...

متن کامل

The Intersection of Massage Practice and Research: Community Massage Therapists as Research Personnel on an NIH-funded Effectiveness Study

INTRODUCTION Few NIH funded studies give community massage therapists the opportunity to become study personnel. A recent NIH/NCCAM-funded study investigating chronic low back pain (CLBP) recruited, trained, and utilized community massage practitioners (CMPs) as study personnel. This study's aim was to determine whether health-related outcomes for CLBP improve when patients are referred from pr...

متن کامل

Debugger for Multi-level Hybrid Parallel Programs on Heterogeneous Accelerator Cluster Architectures – Survey and Challenges

The need to debug hybrid parallel programs on heterogeneous accelerator clusters opens a new set of challenges for concurrently managing the processes and threads at node and accelerator levels. Currently, there exist open source debuggers for traditional HPC clusters, which support debugging of multi-node parallel programs. At present, debugging at the accelerator level is handled through lang...

متن کامل

Revisiting Missing Identity Ring of Iranian cities (A spatial Temporal Analysis of square Elements in Islamic Architecture and urbanism)

From long ago, square has been considered as a space for performing area of cities, and it has been a factor determining the identity of cities through its design and structures. However, with the growth of cities and arrival of modernity management challenges are facing cities. Accordingly, cities have gradually changed into a place for predicting different types of technological, conceptual...

متن کامل

Ranking of fuzzy numbers based on angle measure

In this paper, a novel approach for ranking fuzzy numbers based on the angle measure is introduced. Several left and right spreads at each chosen levels of fuzzy numbers is used to determine center of mass points(CMPs) and then, the angels between the CMPs and the horizontal axis is calculated. The total angle is determined by averaging the computed angles and finally, the novel method is compa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015